2024-07-12 13:55:07.AIbase.10.2k
Zhipu AI Unveils Open-Source Video Understanding Model CogVLM2-Video
Intelligent Spectrum AI has announced the open-source upgrade of the CogVLM2-Video model, a significant advancement in the field of video understanding. The CogVLM2-Video model addresses the limitations of existing video understanding models in handling missing temporal information by introducing multi-frame video images and timestamps as input to the encoder. Utilizing an automated method for constructing time localization data, it has generated 30,000 temporal-related video question and answer